-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Quality] Better error message for incongruent lists of keys #1077
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 6, 2024
ghstack-source-id: 34940a47d84bcf171bf4511187fcc82df88f801f Pull Request resolved: #1077
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 6, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 35.5060μs | 17.7287μs | 56.4056 KOps/s | 60.2000 KOps/s | |
test_plain_set_stack_nested | 48.9710μs | 17.9870μs | 55.5958 KOps/s | 60.2929 KOps/s | |
test_plain_set_nested_inplace | 82.7320μs | 19.2749μs | 51.8809 KOps/s | 54.3695 KOps/s | |
test_plain_set_stack_nested_inplace | 64.3590μs | 19.7052μs | 50.7480 KOps/s | 54.8633 KOps/s | |
test_items | 34.0340μs | 4.1637μs | 240.1695 KOps/s | 239.7203 KOps/s | |
test_items_nested | 0.7155ms | 0.3390ms | 2.9501 KOps/s | 2.8387 KOps/s | |
test_items_nested_locked | 0.4872ms | 0.3390ms | 2.9502 KOps/s | 2.9418 KOps/s | |
test_items_nested_leaf | 0.1330ms | 71.1832μs | 14.0483 KOps/s | 13.8216 KOps/s | |
test_items_stack_nested | 0.5603ms | 0.3412ms | 2.9310 KOps/s | 2.9377 KOps/s | |
test_items_stack_nested_leaf | 0.1480ms | 74.0077μs | 13.5121 KOps/s | 13.4135 KOps/s | |
test_items_stack_nested_locked | 0.5426ms | 0.3442ms | 2.9052 KOps/s | 2.8997 KOps/s | |
test_keys | 32.8510μs | 3.5929μs | 278.3295 KOps/s | 284.0466 KOps/s | |
test_keys_nested | 0.2231ms | 0.1405ms | 7.1150 KOps/s | 7.0776 KOps/s | |
test_keys_nested_locked | 0.6532ms | 0.1468ms | 6.8101 KOps/s | 6.8591 KOps/s | |
test_keys_nested_leaf | 0.2028ms | 0.1216ms | 8.2243 KOps/s | 8.1876 KOps/s | |
test_keys_stack_nested | 0.2311ms | 0.1397ms | 7.1582 KOps/s | 7.1188 KOps/s | |
test_keys_stack_nested_leaf | 0.2090ms | 0.1187ms | 8.4222 KOps/s | 8.3001 KOps/s | |
test_keys_stack_nested_locked | 0.2449ms | 0.1448ms | 6.9081 KOps/s | 6.8916 KOps/s | |
test_values | 9.2072μs | 1.1226μs | 890.8271 KOps/s | 973.0364 KOps/s | |
test_values_nested | 0.1015ms | 55.8993μs | 17.8893 KOps/s | 18.1948 KOps/s | |
test_values_nested_locked | 0.1118ms | 56.1401μs | 17.8126 KOps/s | 18.2122 KOps/s | |
test_values_nested_leaf | 0.1090ms | 59.9425μs | 16.6827 KOps/s | 15.9846 KOps/s | |
test_values_stack_nested | 0.1118ms | 58.0812μs | 17.2173 KOps/s | 17.2957 KOps/s | |
test_values_stack_nested_leaf | 0.1172ms | 60.8017μs | 16.4469 KOps/s | 16.6767 KOps/s | |
test_values_stack_nested_locked | 0.1116ms | 57.9222μs | 17.2645 KOps/s | 17.8613 KOps/s | |
test_membership | 5.0079μs | 0.7668μs | 1.3041 MOps/s | 1.1000 MOps/s | |
test_membership_nested | 32.2500μs | 2.8182μs | 354.8354 KOps/s | 356.6127 KOps/s | |
test_membership_nested_leaf | 24.8370μs | 2.8282μs | 353.5774 KOps/s | 355.5978 KOps/s | |
test_membership_stacked_nested | 33.3730μs | 2.7641μs | 361.7836 KOps/s | 355.7091 KOps/s | |
test_membership_stacked_nested_leaf | 15.6990μs | 2.7719μs | 360.7687 KOps/s | 358.0223 KOps/s | |
test_membership_nested_last | 34.9250μs | 4.1471μs | 241.1336 KOps/s | 239.3512 KOps/s | |
test_membership_nested_leaf_last | 34.0030μs | 4.1272μs | 242.2924 KOps/s | 239.4576 KOps/s | |
test_membership_stacked_nested_last | 33.8430μs | 5.3294μs | 187.6373 KOps/s | 208.5667 KOps/s | |
test_membership_stacked_nested_leaf_last | 19.6270μs | 5.2884μs | 189.0939 KOps/s | 208.9162 KOps/s | |
test_nested_getleaf | 38.4910μs | 10.5460μs | 94.8229 KOps/s | 91.9924 KOps/s | |
test_nested_get | 37.5900μs | 10.0813μs | 99.1940 KOps/s | 98.0216 KOps/s | |
test_stacked_getleaf | 37.7200μs | 10.4855μs | 95.3697 KOps/s | 93.7733 KOps/s | |
test_stacked_get | 38.2810μs | 9.9898μs | 100.1023 KOps/s | 99.5682 KOps/s | |
test_nested_getitemleaf | 46.1660μs | 11.0292μs | 90.6687 KOps/s | 90.1624 KOps/s | |
test_nested_getitem | 37.2890μs | 10.2202μs | 97.8453 KOps/s | 97.4498 KOps/s | |
test_stacked_getitemleaf | 37.2600μs | 10.9757μs | 91.1105 KOps/s | 89.9784 KOps/s | |
test_stacked_getitem | 42.5390μs | 10.3449μs | 96.6658 KOps/s | 96.5534 KOps/s | |
test_lock_nested | 5.7012ms | 0.4503ms | 2.2208 KOps/s | 2.2843 KOps/s | |
test_lock_stack_nested | 0.6777ms | 0.4110ms | 2.4330 KOps/s | 2.4130 KOps/s | |
test_unlock_nested | 0.7288ms | 0.3646ms | 2.7427 KOps/s | 2.7584 KOps/s | |
test_unlock_stack_nested | 0.5386ms | 0.3303ms | 3.0273 KOps/s | 2.9901 KOps/s | |
test_flatten_speed | 0.1488ms | 92.7463μs | 10.7821 KOps/s | 10.7569 KOps/s | |
test_unflatten_speed | 0.7686ms | 0.4732ms | 2.1133 KOps/s | 2.1153 KOps/s | |
test_common_ops | 6.2783ms | 0.7668ms | 1.3042 KOps/s | 1.4076 KOps/s | |
test_creation | 26.2190μs | 2.0910μs | 478.2307 KOps/s | 482.3252 KOps/s | |
test_creation_empty | 49.6720μs | 10.8118μs | 92.4915 KOps/s | 119.0075 KOps/s | |
test_creation_nested_1 | 47.4080μs | 13.7479μs | 72.7382 KOps/s | 91.4983 KOps/s | |
test_creation_nested_2 | 82.1530μs | 17.7953μs | 56.1948 KOps/s | 65.8700 KOps/s | |
test_clone | 89.3860μs | 13.0985μs | 76.3445 KOps/s | 76.5959 KOps/s | |
test_getitem[int] | 1.5649ms | 12.6062μs | 79.3261 KOps/s | 80.6071 KOps/s | |
test_getitem[slice_int] | 0.1426ms | 23.7646μs | 42.0794 KOps/s | 42.0194 KOps/s | |
test_getitem[range] | 0.1795ms | 48.8712μs | 20.4619 KOps/s | 21.4152 KOps/s | |
test_getitem[tuple] | 0.1290ms | 19.6417μs | 50.9122 KOps/s | 51.5840 KOps/s | |
test_getitem[list] | 0.1785ms | 44.3155μs | 22.5655 KOps/s | 23.5866 KOps/s | |
test_setitem_dim[int] | 48.3900μs | 25.0673μs | 39.8926 KOps/s | 38.9192 KOps/s | |
test_setitem_dim[slice_int] | 90.8290μs | 49.9834μs | 20.0066 KOps/s | 19.3065 KOps/s | |
test_setitem_dim[range] | 0.1229ms | 74.5801μs | 13.4084 KOps/s | 13.8366 KOps/s | |
test_setitem_dim[tuple] | 68.2070μs | 39.0435μs | 25.6124 KOps/s | 24.6646 KOps/s | |
test_setitem | 85.9500μs | 19.8909μs | 50.2742 KOps/s | 54.7712 KOps/s | |
test_set | 79.0170μs | 19.1426μs | 52.2395 KOps/s | 55.3334 KOps/s | |
test_set_shared | 1.1702ms | 0.1666ms | 6.0025 KOps/s | 5.9237 KOps/s | |
test_update | 0.7952ms | 22.1891μs | 45.0671 KOps/s | 52.7807 KOps/s | |
test_update_nested | 82.3530μs | 31.5501μs | 31.6956 KOps/s | 34.5649 KOps/s | |
test_update__nested | 0.1493ms | 31.9376μs | 31.3110 KOps/s | 31.6658 KOps/s | |
test_set_nested | 84.8580μs | 21.2096μs | 47.1484 KOps/s | 50.8771 KOps/s | |
test_set_nested_new | 0.1094ms | 25.8649μs | 38.6625 KOps/s | 40.9269 KOps/s | |
test_select | 0.2203ms | 41.6108μs | 24.0322 KOps/s | 25.5339 KOps/s | |
test_select_nested | 0.1423ms | 60.2109μs | 16.6083 KOps/s | 16.8557 KOps/s | |
test_exclude_nested | 0.1177ms | 75.9672μs | 13.1636 KOps/s | 13.3532 KOps/s | |
test_empty[True] | 0.5456ms | 0.3534ms | 2.8295 KOps/s | 2.8128 KOps/s | |
test_empty[False] | 14.5673μs | 1.3033μs | 767.2588 KOps/s | 798.4231 KOps/s | |
test_unbind_speed | 0.5019ms | 0.2708ms | 3.6924 KOps/s | 3.8473 KOps/s | |
test_unbind_speed_stack0 | 0.4667ms | 0.2614ms | 3.8259 KOps/s | 3.8958 KOps/s | |
test_unbind_speed_stack1 | 0.1011s | 0.7765ms | 1.2878 KOps/s | 1.4269 KOps/s | |
test_split | 1.7910ms | 1.5739ms | 635.3529 Ops/s | 583.6478 Ops/s | |
test_chunk | 0.1014s | 1.8979ms | 526.9111 Ops/s | 578.7770 Ops/s | |
test_consolidate_njt[False-None] | 8.9248ms | 8.3358ms | 119.9650 Ops/s | 123.6269 Ops/s | |
test_creation[device0] | 0.2701ms | 91.6241μs | 10.9142 KOps/s | 10.9469 KOps/s | |
test_creation_from_tensor | 3.8416ms | 95.2757μs | 10.4959 KOps/s | 10.4582 KOps/s | |
test_add_one[memmap_tensor0] | 0.1730ms | 5.1102μs | 195.6861 KOps/s | 208.3096 KOps/s | |
test_contiguous[memmap_tensor0] | 23.5940μs | 0.5253μs | 1.9037 MOps/s | 1.9761 MOps/s | |
test_stack[memmap_tensor0] | 29.2540μs | 3.3527μs | 298.2709 KOps/s | 303.8788 KOps/s | |
test_memmaptd_index | 0.9478ms | 0.2435ms | 4.1064 KOps/s | 4.2859 KOps/s | |
test_memmaptd_index_astensor | 0.7201ms | 0.3222ms | 3.1034 KOps/s | 3.2022 KOps/s | |
test_memmaptd_index_op | 1.0151ms | 0.5912ms | 1.6916 KOps/s | 1.8483 KOps/s | |
test_serialize_model | 0.1245s | 0.1163s | 8.5998 Ops/s | 7.6157 Ops/s | |
test_serialize_model_pickle | 0.4610s | 0.3903s | 2.5621 Ops/s | 2.4835 Ops/s | |
test_serialize_weights | 0.2242s | 0.1310s | 7.6321 Ops/s | 8.8643 Ops/s | |
test_serialize_weights_returnearly | 0.1686s | 0.1607s | 6.2227 Ops/s | 6.5843 Ops/s | |
test_serialize_weights_pickle | 0.6311s | 0.4558s | 2.1941 Ops/s | 2.5443 Ops/s | |
test_serialize_weights_filesystem | 0.1490s | 0.1406s | 7.1119 Ops/s | 7.1272 Ops/s | |
test_serialize_model_filesystem | 0.1644s | 0.1506s | 6.6389 Ops/s | 6.1530 Ops/s | |
test_reshape_pytree | 81.6120μs | 26.7418μs | 37.3947 KOps/s | 36.0513 KOps/s | |
test_reshape_td | 87.4520μs | 32.1421μs | 31.1119 KOps/s | 30.5749 KOps/s | |
test_view_pytree | 63.7990μs | 26.9486μs | 37.1077 KOps/s | 37.5261 KOps/s | |
test_view_td | 0.1011ms | 37.7685μs | 26.4771 KOps/s | 26.8492 KOps/s | |
test_unbind_pytree | 66.0430μs | 30.2227μs | 33.0877 KOps/s | 33.4006 KOps/s | |
test_unbind_td | 0.3222ms | 39.3391μs | 25.4200 KOps/s | 25.7583 KOps/s | |
test_split_pytree | 90.0880μs | 30.2613μs | 33.0455 KOps/s | 33.8097 KOps/s | |
test_split_td | 0.4858ms | 44.2446μs | 22.6016 KOps/s | 22.9935 KOps/s | |
test_add_pytree | 72.5550μs | 36.7179μs | 27.2347 KOps/s | 27.3511 KOps/s | |
test_add_td | 0.1186ms | 53.8141μs | 18.5825 KOps/s | 19.9292 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2439ms | 64.3184μs | 15.5477 KOps/s | 16.0773 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3732ms | 0.1627ms | 6.1446 KOps/s | 6.2876 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1083ms | 46.5243μs | 21.4942 KOps/s | 21.7415 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2399ms | 0.1214ms | 8.2405 KOps/s | 8.4109 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1176ms | 26.6210μs | 37.5643 KOps/s | 39.3398 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1053ms | 54.0539μs | 18.5001 KOps/s | 18.7894 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1468ms | 79.9046μs | 12.5149 KOps/s | 12.6127 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1304ms | 68.7721μs | 14.5408 KOps/s | 14.7550 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2225ms | 0.1060ms | 9.4350 KOps/s | 9.5666 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3334ms | 0.2019ms | 4.9522 KOps/s | 4.9235 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1059ms | 45.1827μs | 22.1324 KOps/s | 21.8517 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4497ms | 61.4332μs | 16.2778 KOps/s | 16.6067 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2331ms | 0.1052ms | 9.5046 KOps/s | 9.7368 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4143ms | 0.2109ms | 4.7407 KOps/s | 5.0168 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3854ms | 0.2134ms | 4.6860 KOps/s | 4.6991 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1825ms | 0.1064ms | 9.4005 KOps/s | 9.4795 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1233ms | 54.2046μs | 18.4486 KOps/s | 18.7807 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 95.8390μs | 45.5742μs | 21.9422 KOps/s | 22.0038 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5933ms | 0.1639ms | 6.1021 KOps/s | 6.3475 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2015ms | 0.1043ms | 9.5891 KOps/s | 9.3294 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 55.1330μs | 21.3119μs | 46.9222 KOps/s | 48.3580 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1229ms | 59.2460μs | 16.8788 KOps/s | 15.9795 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1644ms | 85.3151μs | 11.7213 KOps/s | 12.3039 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1450ms | 72.8638μs | 13.7242 KOps/s | 14.3369 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3644ms | 0.2102ms | 4.7578 KOps/s | 4.8428 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0627ms | 1.2946ms | 772.4627 Ops/s | 782.4049 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3752ms | 0.2086ms | 4.7931 KOps/s | 4.9841 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3521ms | 0.7999ms | 1.2501 KOps/s | 1.3054 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8013ms | 0.4683ms | 2.1352 KOps/s | 2.1400 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.3715ms | 2.6234ms | 381.1854 Ops/s | 399.0769 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 98.0220μs | 36.4809μs | 27.4116 KOps/s | 27.2631 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5838ms | 33.7031μs | 29.6709 KOps/s | 31.6484 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 84.6070μs | 30.0483μs | 33.2797 KOps/s | 34.0752 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1832ms | 23.9402μs | 41.7707 KOps/s | 43.2867 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1200ms | 30.3799μs | 32.9165 KOps/s | 33.3309 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 81.2110μs | 24.1760μs | 41.3633 KOps/s | 43.5952 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1536ms | 52.4998μs | 19.0477 KOps/s | 19.4417 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5777ms | 20.0699μs | 49.8259 KOps/s | 51.1617 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1201ms | 45.3024μs | 22.0739 KOps/s | 22.4237 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 67.9260μs | 18.9718μs | 52.7099 KOps/s | 52.9038 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1447ms | 45.5717μs | 21.9434 KOps/s | 22.1375 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 77.9020μs | 18.9376μs | 52.8049 KOps/s | 51.9187 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1283ms | 53.9094μs | 18.5496 KOps/s | 19.3852 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9406ms | 19.9025μs | 50.2450 KOps/s | 51.8242 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1004ms | 45.9777μs | 21.7497 KOps/s | 22.6064 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 68.1670μs | 19.3348μs | 51.7201 KOps/s | 54.1685 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1127ms | 46.2440μs | 21.6244 KOps/s | 22.5784 KOps/s | |
test_compile_indexing[int-pytree-eager] | 78.5960μs | 19.0504μs | 52.4923 KOps/s | 54.0501 KOps/s | |
test_mod_add[eager] | 75.5200μs | 26.6177μs | 37.5690 KOps/s | 41.9757 KOps/s | |
test_mod_add[compile] | 0.1097ms | 45.7086μs | 21.8777 KOps/s | 21.9189 KOps/s | |
test_mod_add[compile-overhead] | 0.1336ms | 46.4343μs | 21.5358 KOps/s | 22.6114 KOps/s | |
test_mod_wrap[eager] | 0.4446ms | 0.2169ms | 4.6103 KOps/s | 4.7853 KOps/s | |
test_mod_wrap[compile] | 1.7393ms | 0.2085ms | 4.7963 KOps/s | 4.9876 KOps/s | |
test_mod_wrap[compile-overhead] | 1.7946ms | 0.2063ms | 4.8464 KOps/s | 4.7029 KOps/s | |
test_mod_wrap_and_backward[eager] | 21.7749ms | 12.4455ms | 80.3506 Ops/s | 88.1953 Ops/s | |
test_mod_wrap_and_backward[compile] | 18.8910ms | 12.6831ms | 78.8449 Ops/s | 74.1376 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.2256ms | 11.9939ms | 83.3756 Ops/s | 75.9622 Ops/s | |
test_seq_add[eager] | 0.1764ms | 92.6751μs | 10.7904 KOps/s | 11.8210 KOps/s | |
test_seq_add[compile] | 0.1373ms | 61.6539μs | 16.2196 KOps/s | 16.6889 KOps/s | |
test_seq_add[compile-overhead] | 0.1786ms | 60.6318μs | 16.4930 KOps/s | 16.7977 KOps/s | |
test_seq_wrap[eager] | 0.7122ms | 0.3905ms | 2.5606 KOps/s | 2.6427 KOps/s | |
test_seq_wrap[compile] | 0.4285ms | 0.2312ms | 4.3252 KOps/s | 4.4553 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3935ms | 0.2306ms | 4.3370 KOps/s | 4.4763 KOps/s | |
test_func_call_runtime[False-eager] | 0.7312ms | 0.5526ms | 1.8097 KOps/s | 1.8884 KOps/s | |
test_func_call_runtime[False-compile] | 0.5655ms | 0.4331ms | 2.3089 KOps/s | 2.3276 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8477ms | 0.4348ms | 2.2997 KOps/s | 2.3470 KOps/s | |
test_func_call_runtime[True-eager] | 1.2729ms | 0.7697ms | 1.2992 KOps/s | 1.3517 KOps/s | |
test_func_call_runtime[True-compile] | 0.8154ms | 0.4715ms | 2.1209 KOps/s | 2.1468 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9948ms | 0.4735ms | 2.1121 KOps/s | 2.1480 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1829ms | 0.5584ms | 1.7909 KOps/s | 1.8882 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7821ms | 0.4303ms | 2.3242 KOps/s | 2.3653 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7824ms | 0.4296ms | 2.3279 KOps/s | 2.3299 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.5107ms | 0.9040ms | 1.1062 KOps/s | 1.1441 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8050ms | 0.4984ms | 2.0066 KOps/s | 2.0096 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6031ms | 0.4945ms | 2.0223 KOps/s | 2.0317 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.7126ms | 1.9202ms | 520.7659 Ops/s | 535.6792 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9109ms | 0.5284ms | 1.8925 KOps/s | 1.9129 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9293ms | 0.5295ms | 1.8885 KOps/s | 1.9342 KOps/s | |
test_distributed | 0.3219ms | 0.1282ms | 7.7984 KOps/s | 7.7012 KOps/s | |
test_tdmodule | 60.0120μs | 18.3585μs | 54.4706 KOps/s | 56.3563 KOps/s | |
test_tdmodule_dispatch | 68.7170μs | 35.7605μs | 27.9638 KOps/s | 28.7370 KOps/s | |
test_tdseq | 47.0770μs | 20.9232μs | 47.7938 KOps/s | 50.5921 KOps/s | |
test_tdseq_dispatch | 82.6540μs | 40.7365μs | 24.5480 KOps/s | 25.7711 KOps/s | |
test_instantiation_functorch | 2.4289ms | 1.5748ms | 635.0176 Ops/s | 657.3026 Ops/s | |
test_exec_functorch | 0.3664ms | 0.1824ms | 5.4825 KOps/s | 5.5575 KOps/s | |
test_exec_functional_call | 0.4271ms | 0.1737ms | 5.7575 KOps/s | 5.7482 KOps/s | |
test_exec_td_decorator | 0.5350ms | 0.2303ms | 4.3414 KOps/s | 4.4024 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8925ms | 0.6396ms | 1.5635 KOps/s | 1.5844 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0310ms | 0.6420ms | 1.5578 KOps/s | 1.6005 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7219ms | 0.5264ms | 1.8996 KOps/s | 1.9351 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8318ms | 0.5280ms | 1.8939 KOps/s | 1.9298 KOps/s | |
test_to_module_speed[True] | 1.4998ms | 1.3202ms | 757.4639 Ops/s | 767.6307 Ops/s | |
test_to_module_speed[False] | 2.0482ms | 1.2926ms | 773.6465 Ops/s | 786.1529 Ops/s | |
test_tc_init | 82.8540μs | 44.9841μs | 22.2301 KOps/s | 24.8824 KOps/s | |
test_tc_init_nested | 0.1509ms | 89.6982μs | 11.1485 KOps/s | 12.5249 KOps/s | |
test_tc_first_layer_tensor | 39.1930μs | 1.5534μs | 643.7531 KOps/s | 674.8370 KOps/s | |
test_tc_first_layer_nontensor | 25.1670μs | 4.7421μs | 210.8779 KOps/s | 216.2479 KOps/s | |
test_tc_second_layer_tensor | 44.1030μs | 2.9277μs | 341.5654 KOps/s | 363.9477 KOps/s | |
test_tc_second_layer_nontensor | 36.5780μs | 6.0666μs | 164.8365 KOps/s | 166.4325 KOps/s | |
test_unbind | 0.2368s | 15.2605ms | 65.5288 Ops/s | 80.1158 Ops/s | |
test_full_like | 9.8212ms | 8.1753ms | 122.3196 Ops/s | 130.2432 Ops/s | |
test_zeros_like | 4.1974ms | 3.2412ms | 308.5284 Ops/s | 145.3925 Ops/s | |
test_ones_like | 4.0240ms | 3.5565ms | 281.1784 Ops/s | 123.3444 Ops/s | |
test_clone | 8.2610ms | 5.6871ms | 175.8358 Ops/s | 103.3186 Ops/s | |
test_squeeze | 63.6690μs | 12.2968μs | 81.3220 KOps/s | 83.1657 KOps/s | |
test_unsqueeze | 0.3201ms | 91.7510μs | 10.8991 KOps/s | 11.2268 KOps/s | |
test_split | 0.4355ms | 0.1887ms | 5.2991 KOps/s | 5.2944 KOps/s | |
test_permute | 0.3647ms | 0.2195ms | 4.5553 KOps/s | 4.5622 KOps/s | |
test_stack | 31.0928ms | 27.2412ms | 36.7091 Ops/s | 39.4929 Ops/s | |
test_cat | 31.8017ms | 27.1672ms | 36.8090 Ops/s | 39.5785 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 30.2600μs | 10.6968μs | 93.4861 KOps/s | 93.7473 KOps/s | |
test_plain_set_stack_nested | 33.8810μs | 10.8173μs | 92.4445 KOps/s | 93.0594 KOps/s | |
test_plain_set_nested_inplace | 0.1200ms | 11.7163μs | 85.3515 KOps/s | 86.0545 KOps/s | |
test_plain_set_stack_nested_inplace | 73.8810μs | 11.6196μs | 86.0616 KOps/s | 86.2080 KOps/s | |
test_items | 35.1310μs | 2.8858μs | 346.5296 KOps/s | 345.2307 KOps/s | |
test_items_nested | 0.3669ms | 0.3177ms | 3.1471 KOps/s | 3.1343 KOps/s | |
test_items_nested_locked | 0.4247ms | 0.3201ms | 3.1240 KOps/s | 3.1080 KOps/s | |
test_items_nested_leaf | 0.2293ms | 58.0325μs | 17.2317 KOps/s | 17.2449 KOps/s | |
test_items_stack_nested | 0.4926ms | 0.3190ms | 3.1348 KOps/s | 3.1236 KOps/s | |
test_items_stack_nested_leaf | 0.2324ms | 59.3958μs | 16.8362 KOps/s | 17.0688 KOps/s | |
test_items_stack_nested_locked | 0.4925ms | 0.3260ms | 3.0677 KOps/s | 3.1147 KOps/s | |
test_keys | 29.7610μs | 3.4562μs | 289.3322 KOps/s | 274.8032 KOps/s | |
test_keys_nested | 0.2550ms | 70.1649μs | 14.2521 KOps/s | 14.1591 KOps/s | |
test_keys_nested_locked | 0.7587ms | 75.4310μs | 13.2572 KOps/s | 13.0998 KOps/s | |
test_keys_nested_leaf | 91.8910μs | 61.5048μs | 16.2589 KOps/s | 16.1461 KOps/s | |
test_keys_stack_nested | 0.1044ms | 70.8285μs | 14.1186 KOps/s | 14.0356 KOps/s | |
test_keys_stack_nested_leaf | 0.1072ms | 61.9022μs | 16.1545 KOps/s | 16.1150 KOps/s | |
test_keys_stack_nested_locked | 0.2028ms | 75.9155μs | 13.1725 KOps/s | 13.0238 KOps/s | |
test_values | 7.5885μs | 0.8419μs | 1.1878 MOps/s | 1.1847 MOps/s | |
test_values_nested | 66.4910μs | 31.4193μs | 31.8276 KOps/s | 32.1473 KOps/s | |
test_values_nested_locked | 64.9610μs | 32.7059μs | 30.5755 KOps/s | 30.5962 KOps/s | |
test_values_nested_leaf | 96.1020μs | 33.7108μs | 29.6641 KOps/s | 29.5883 KOps/s | |
test_values_stack_nested | 69.7310μs | 31.6601μs | 31.5855 KOps/s | 31.6664 KOps/s | |
test_values_stack_nested_leaf | 71.0810μs | 34.1626μs | 29.2717 KOps/s | 29.6287 KOps/s | |
test_values_stack_nested_locked | 82.7410μs | 33.2408μs | 30.0835 KOps/s | 30.0197 KOps/s | |
test_membership | 4.0791μs | 0.5088μs | 1.9655 MOps/s | 1.9504 MOps/s | |
test_membership_nested | 16.1100μs | 1.9199μs | 520.8592 KOps/s | 507.9010 KOps/s | |
test_membership_nested_leaf | 14.0267μs | 1.8712μs | 534.4187 KOps/s | 530.0153 KOps/s | |
test_membership_stacked_nested | 22.2400μs | 1.9544μs | 511.6656 KOps/s | 503.9158 KOps/s | |
test_membership_stacked_nested_leaf | 42.3610μs | 1.9989μs | 500.2811 KOps/s | 507.7895 KOps/s | |
test_membership_nested_last | 39.2700μs | 2.8274μs | 353.6855 KOps/s | 353.5632 KOps/s | |
test_membership_nested_leaf_last | 37.8910μs | 2.8493μs | 350.9577 KOps/s | 352.1119 KOps/s | |
test_membership_stacked_nested_last | 31.6010μs | 4.8175μs | 207.5778 KOps/s | 350.4738 KOps/s | |
test_membership_stacked_nested_leaf_last | 73.8120μs | 4.8097μs | 207.9112 KOps/s | 353.3274 KOps/s | |
test_nested_getleaf | 28.0800μs | 6.0135μs | 166.2912 KOps/s | 165.0153 KOps/s | |
test_nested_get | 38.8500μs | 5.7326μs | 174.4396 KOps/s | 174.8017 KOps/s | |
test_stacked_getleaf | 34.3710μs | 6.0324μs | 165.7704 KOps/s | 164.5791 KOps/s | |
test_stacked_get | 56.1710μs | 5.7381μs | 174.2749 KOps/s | 174.6535 KOps/s | |
test_nested_getitemleaf | 59.8310μs | 6.1012μs | 163.9015 KOps/s | 162.6326 KOps/s | |
test_nested_getitem | 28.9000μs | 5.7659μs | 173.4331 KOps/s | 172.6132 KOps/s | |
test_stacked_getitemleaf | 36.6700μs | 6.1021μs | 163.8778 KOps/s | 163.2860 KOps/s | |
test_stacked_getitem | 32.8600μs | 5.7790μs | 173.0403 KOps/s | 172.4611 KOps/s | |
test_lock_nested | 1.1314ms | 0.3717ms | 2.6905 KOps/s | 2.7201 KOps/s | |
test_lock_stack_nested | 0.4190ms | 0.3387ms | 2.9523 KOps/s | 2.9362 KOps/s | |
test_unlock_nested | 0.9971ms | 0.3121ms | 3.2040 KOps/s | 3.2479 KOps/s | |
test_unlock_stack_nested | 0.4159ms | 0.2785ms | 3.5903 KOps/s | 3.5795 KOps/s | |
test_flatten_speed | 0.1093ms | 72.8530μs | 13.7263 KOps/s | 13.9089 KOps/s | |
test_unflatten_speed | 0.3376ms | 0.2964ms | 3.3733 KOps/s | 3.4637 KOps/s | |
test_common_ops | 93.9528ms | 0.6836ms | 1.4628 KOps/s | 1.6665 KOps/s | |
test_creation | 0.1044ms | 1.4875μs | 672.2634 KOps/s | 680.3091 KOps/s | |
test_creation_empty | 41.7810μs | 7.8917μs | 126.7151 KOps/s | 131.9778 KOps/s | |
test_creation_nested_1 | 29.1410μs | 9.3965μs | 106.4224 KOps/s | 110.5683 KOps/s | |
test_creation_nested_2 | 44.0100μs | 11.8855μs | 84.1359 KOps/s | 85.8615 KOps/s | |
test_clone | 83.4310μs | 10.6807μs | 93.6267 KOps/s | 92.3588 KOps/s | |
test_getitem[int] | 1.5288ms | 11.1798μs | 89.4472 KOps/s | 90.0601 KOps/s | |
test_getitem[slice_int] | 0.1156ms | 23.1505μs | 43.1956 KOps/s | 46.4019 KOps/s | |
test_getitem[range] | 0.2184ms | 43.7420μs | 22.8613 KOps/s | 25.9231 KOps/s | |
test_getitem[tuple] | 0.2120ms | 19.8974μs | 50.2579 KOps/s | 53.6210 KOps/s | |
test_getitem[list] | 0.2030ms | 39.0330μs | 25.6194 KOps/s | 28.9461 KOps/s | |
test_setitem_dim[int] | 43.0600μs | 21.0858μs | 47.4253 KOps/s | 50.6924 KOps/s | |
test_setitem_dim[slice_int] | 68.4110μs | 41.1780μs | 24.2848 KOps/s | 25.3953 KOps/s | |
test_setitem_dim[range] | 0.2322ms | 59.2750μs | 16.8705 KOps/s | 18.1629 KOps/s | |
test_setitem_dim[tuple] | 0.2070ms | 35.7067μs | 28.0060 KOps/s | 29.6580 KOps/s | |
test_setitem | 0.1939ms | 15.3444μs | 65.1702 KOps/s | 65.4970 KOps/s | |
test_set | 0.2046ms | 14.8882μs | 67.1673 KOps/s | 66.7760 KOps/s | |
test_set_shared | 1.7705ms | 0.1491ms | 6.7072 KOps/s | 6.7375 KOps/s | |
test_update | 0.3269ms | 17.7050μs | 56.4811 KOps/s | 56.9642 KOps/s | |
test_update_nested | 0.2085ms | 23.4370μs | 42.6676 KOps/s | 45.5094 KOps/s | |
test_update__nested | 0.9921ms | 24.9687μs | 40.0501 KOps/s | 40.2242 KOps/s | |
test_set_nested | 0.1247ms | 15.9423μs | 62.7263 KOps/s | 62.0567 KOps/s | |
test_set_nested_new | 97.6410μs | 18.4510μs | 54.1975 KOps/s | 55.0399 KOps/s | |
test_select | 0.2112ms | 31.8102μs | 31.4364 KOps/s | 32.5363 KOps/s | |
test_select_nested | 0.1501ms | 41.8311μs | 23.9057 KOps/s | 23.8648 KOps/s | |
test_exclude_nested | 87.3410μs | 59.5302μs | 16.7982 KOps/s | 16.9406 KOps/s | |
test_empty[True] | 0.6175ms | 0.2570ms | 3.8907 KOps/s | 3.9082 KOps/s | |
test_empty[False] | 3.3990μs | 0.7368μs | 1.3572 MOps/s | 1.3453 MOps/s | |
test_to | 84.8610μs | 56.5809μs | 17.6738 KOps/s | 17.9108 KOps/s | |
test_to_nonblocking | 0.2196ms | 48.2288μs | 20.7345 KOps/s | 20.8229 KOps/s | |
test_unbind_speed | 1.6770ms | 0.2362ms | 4.2333 KOps/s | 4.2843 KOps/s | |
test_unbind_speed_stack0 | 0.3696ms | 0.2339ms | 4.2754 KOps/s | 4.2843 KOps/s | |
test_unbind_speed_stack1 | 91.6925ms | 0.6470ms | 1.5457 KOps/s | 1.5303 KOps/s | |
test_split | 94.3599ms | 1.6103ms | 620.9917 Ops/s | 572.6692 Ops/s | |
test_chunk | 93.6143ms | 1.6154ms | 619.0555 Ops/s | 678.4630 Ops/s | |
test_consolidate[False-None] | 96.9454ms | 2.8809ms | 347.1135 Ops/s | 343.7817 Ops/s | |
test_consolidate[default-None] | 1.8294ms | 1.6463ms | 607.4317 Ops/s | 604.5958 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8768ms | 1.7013ms | 587.7766 Ops/s | 592.6926 Ops/s | |
test_consolidate_njt[False-None] | 7.0160ms | 6.7831ms | 147.4262 Ops/s | 149.1160 Ops/s | |
test_to[False-False-None] | 2.0232ms | 1.7433ms | 573.6389 Ops/s | 574.5831 Ops/s | |
test_to[True-False-None] | 1.6382ms | 1.3906ms | 719.1343 Ops/s | 726.4584 Ops/s | |
test_to[within-False-None] | 4.4532ms | 4.1889ms | 238.7242 Ops/s | 240.0281 Ops/s | |
test_to[True-default-None] | 6.1463ms | 5.5655ms | 179.6771 Ops/s | 190.6434 Ops/s | |
test_to_njt[False-False-None] | 7.6517ms | 7.2285ms | 138.3405 Ops/s | 140.6843 Ops/s | |
test_to_njt[True-False-None] | 6.0578ms | 5.6768ms | 176.1543 Ops/s | 175.7311 Ops/s | |
test_to_njt[within-False-None] | 13.3236ms | 12.5593ms | 79.6224 Ops/s | 79.7344 Ops/s | |
test_creation[device0] | 0.5413ms | 81.1880μs | 12.3171 KOps/s | 12.4792 KOps/s | |
test_creation_from_tensor | 0.5059ms | 85.0310μs | 11.7604 KOps/s | 11.8567 KOps/s | |
test_add_one[memmap_tensor0] | 0.2377ms | 6.9971μs | 142.9159 KOps/s | 141.0380 KOps/s | |
test_contiguous[memmap_tensor0] | 1.7790μs | 0.4306μs | 2.3226 MOps/s | 2.3149 MOps/s | |
test_stack[memmap_tensor0] | 0.1524ms | 4.6689μs | 214.1838 KOps/s | 211.2203 KOps/s | |
test_memmaptd_index | 0.5989ms | 0.2631ms | 3.8008 KOps/s | 3.7614 KOps/s | |
test_memmaptd_index_astensor | 0.6205ms | 0.3212ms | 3.1130 KOps/s | 3.0676 KOps/s | |
test_memmaptd_index_op | 1.0210ms | 0.6059ms | 1.6503 KOps/s | 1.6516 KOps/s | |
test_serialize_model | 0.1315s | 0.1305s | 7.6636 Ops/s | 7.6502 Ops/s | |
test_serialize_model_pickle | 1.3489s | 1.1856s | 0.8435 Ops/s | 0.8242 Ops/s | |
test_serialize_weights | 0.1308s | 0.1299s | 7.6961 Ops/s | 7.6712 Ops/s | |
test_serialize_weights_returnearly | 0.4986s | 70.7026ms | 14.1437 Ops/s | 11.4118 Ops/s | |
test_serialize_weights_pickle | 1.4266s | 1.2288s | 0.8138 Ops/s | 0.8390 Ops/s | |
test_reshape_pytree | 0.1714ms | 22.7370μs | 43.9811 KOps/s | 44.1881 KOps/s | |
test_reshape_td | 0.1127ms | 27.1200μs | 36.8732 KOps/s | 36.2601 KOps/s | |
test_view_pytree | 0.1166ms | 22.5568μs | 44.3325 KOps/s | 44.2455 KOps/s | |
test_view_td | 0.1132ms | 31.6150μs | 31.6305 KOps/s | 31.9939 KOps/s | |
test_unbind_pytree | 0.1340ms | 28.7224μs | 34.8160 KOps/s | 35.0979 KOps/s | |
test_unbind_td | 0.9026ms | 36.0296μs | 27.7549 KOps/s | 25.3295 KOps/s | |
test_split_pytree | 0.1620ms | 30.1544μs | 33.1627 KOps/s | 32.9190 KOps/s | |
test_split_td | 0.9729ms | 39.8147μs | 25.1164 KOps/s | 24.4085 KOps/s | |
test_add_pytree | 0.1654ms | 35.1173μs | 28.4760 KOps/s | 27.5815 KOps/s | |
test_add_td | 0.2438ms | 46.2695μs | 21.6125 KOps/s | 20.9735 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.3092ms | 0.1276ms | 7.8374 KOps/s | 7.9632 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2737ms | 0.1253ms | 7.9830 KOps/s | 7.8324 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2460ms | 99.1460μs | 10.0861 KOps/s | 9.8040 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3241ms | 0.1543ms | 6.4806 KOps/s | 6.3511 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1999ms | 24.1621μs | 41.3872 KOps/s | 42.4644 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1355ms | 26.6643μs | 37.5033 KOps/s | 36.6481 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2102ms | 64.8373μs | 15.4232 KOps/s | 15.4153 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1955ms | 49.0278μs | 20.3966 KOps/s | 19.9708 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3310ms | 0.1497ms | 6.6782 KOps/s | 6.4960 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3703ms | 0.2106ms | 4.7484 KOps/s | 4.7526 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2951ms | 0.1041ms | 9.6084 KOps/s | 10.0061 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.2393ms | 54.6157μs | 18.3097 KOps/s | 18.7456 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2948ms | 0.1474ms | 6.7863 KOps/s | 6.5158 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6964ms | 0.5028ms | 1.9887 KOps/s | 1.8731 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4154ms | 0.2510ms | 3.9843 KOps/s | 3.9837 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3429ms | 0.1462ms | 6.8382 KOps/s | 6.8206 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2691ms | 63.8121μs | 15.6710 KOps/s | 15.7078 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2462ms | 0.1004ms | 9.9602 KOps/s | 10.0180 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6218ms | 0.4230ms | 2.3642 KOps/s | 2.3446 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2949ms | 0.1430ms | 6.9930 KOps/s | 7.0630 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1632ms | 19.9514μs | 50.1218 KOps/s | 52.0812 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1308ms | 26.8189μs | 37.2871 KOps/s | 37.1817 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2001ms | 70.0236μs | 14.2809 KOps/s | 14.3648 KOps/s | |
test_compile_copy_flat[pytree-eager] | 82.0610μs | 51.6348μs | 19.3668 KOps/s | 19.3351 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6734ms | 0.3996ms | 2.5023 KOps/s | 2.1543 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.1234ms | 2.7923ms | 358.1254 Ops/s | 354.2340 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6101ms | 0.4399ms | 2.2734 KOps/s | 2.2055 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.1941ms | 2.8022ms | 356.8630 Ops/s | 351.6482 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3288ms | 0.1244ms | 8.0408 KOps/s | 8.3875 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5938ms | 86.3579μs | 11.5797 KOps/s | 11.5154 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3078ms | 0.1159ms | 8.6298 KOps/s | 8.9319 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2555ms | 74.1303μs | 13.4898 KOps/s | 13.9153 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2978ms | 0.1137ms | 8.7986 KOps/s | 8.8496 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2556ms | 74.4684μs | 13.4285 KOps/s | 13.5643 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.3301ms | 0.1079ms | 9.2684 KOps/s | 9.6353 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2233ms | 18.1779μs | 55.0120 KOps/s | 48.0995 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2753ms | 0.1021ms | 9.7981 KOps/s | 10.0277 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1627ms | 16.0402μs | 62.3434 KOps/s | 61.8433 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2868ms | 0.1012ms | 9.8791 KOps/s | 9.9591 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1597ms | 16.0985μs | 62.1177 KOps/s | 61.6286 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.3074ms | 0.1081ms | 9.2504 KOps/s | 9.5827 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5826ms | 17.7408μs | 56.3673 KOps/s | 56.0950 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2995ms | 0.1040ms | 9.6178 KOps/s | 9.9697 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1648ms | 16.1247μs | 62.0168 KOps/s | 61.0681 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2613ms | 99.7593μs | 10.0241 KOps/s | 9.9510 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1568ms | 16.0372μs | 62.3551 KOps/s | 61.5870 KOps/s | |
test_mod_add[eager] | 0.1912ms | 32.7897μs | 30.4974 KOps/s | 30.7110 KOps/s | |
test_mod_add[compile] | 0.2278ms | 78.6156μs | 12.7201 KOps/s | 12.4856 KOps/s | |
test_mod_add[compile-overhead] | 0.3119ms | 0.1664ms | 6.0087 KOps/s | 5.6961 KOps/s | |
test_mod_wrap[eager] | 0.4377ms | 0.2509ms | 3.9854 KOps/s | 3.9386 KOps/s | |
test_mod_wrap[compile] | 1.5818ms | 0.2994ms | 3.3404 KOps/s | 3.4418 KOps/s | |
test_mod_wrap[compile-overhead] | 7.6891ms | 4.0866ms | 244.7026 Ops/s | 249.7799 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6648ms | 1.3893ms | 719.7669 Ops/s | 670.1383 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4836ms | 1.2830ms | 779.4342 Ops/s | 771.7806 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3802ms | 0.9158ms | 1.0919 KOps/s | 1.0665 KOps/s | |
test_seq_add[eager] | 0.5152ms | 98.4599μs | 10.1564 KOps/s | 10.0898 KOps/s | |
test_seq_add[compile] | 0.4829ms | 88.2810μs | 11.3275 KOps/s | 11.1509 KOps/s | |
test_seq_add[compile-overhead] | 0.2760ms | 0.1297ms | 7.7095 KOps/s | 7.7251 KOps/s | |
test_seq_wrap[eager] | 0.7962ms | 0.3918ms | 2.5523 KOps/s | 2.5162 KOps/s | |
test_seq_wrap[compile] | 0.7316ms | 0.3204ms | 3.1216 KOps/s | 3.2487 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3909ms | 0.2237ms | 4.4695 KOps/s | 4.3892 KOps/s | |
test_func_call_runtime[False-eager] | 1.2304ms | 0.8146ms | 1.2275 KOps/s | 1.2975 KOps/s | |
test_func_call_runtime[False-compile] | 1.1683ms | 0.7578ms | 1.3196 KOps/s | 1.3014 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7576ms | 0.3630ms | 2.7545 KOps/s | 2.7366 KOps/s | |
test_func_call_runtime[True-eager] | 1.3500ms | 0.9261ms | 1.0798 KOps/s | 1.0660 KOps/s | |
test_func_call_runtime[True-compile] | 1.1977ms | 0.7790ms | 1.2836 KOps/s | 1.2653 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5353ms | 0.3870ms | 2.5841 KOps/s | 2.5990 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.2093ms | 0.7742ms | 1.2917 KOps/s | 1.2708 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1728ms | 0.7721ms | 1.2952 KOps/s | 1.2443 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5616ms | 0.3651ms | 2.7393 KOps/s | 2.7117 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4343ms | 1.0283ms | 972.4348 Ops/s | 966.7460 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2125ms | 0.8181ms | 1.2223 KOps/s | 1.2158 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5655ms | 0.4119ms | 2.4280 KOps/s | 2.4050 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6758ms | 2.1275ms | 470.0312 Ops/s | 468.1666 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0857ms | 0.8491ms | 1.1777 KOps/s | 1.1995 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5588ms | 0.4165ms | 2.4008 KOps/s | 2.4159 KOps/s | |
test_distributed | 5.4844ms | 0.2699ms | 3.7045 KOps/s | 8.4105 KOps/s | |
test_tdmodule | 52.0010μs | 14.0712μs | 71.0672 KOps/s | 71.8202 KOps/s | |
test_tdmodule_dispatch | 0.1423ms | 27.8612μs | 35.8922 KOps/s | 36.4481 KOps/s | |
test_tdseq | 32.9900μs | 15.2518μs | 65.5660 KOps/s | 62.5510 KOps/s | |
test_tdseq_dispatch | 0.1314ms | 30.4899μs | 32.7978 KOps/s | 31.5678 KOps/s | |
test_instantiation_functorch | 1.7105ms | 1.5583ms | 641.7080 Ops/s | 638.0219 Ops/s | |
test_exec_functorch | 0.2738ms | 0.1531ms | 6.5310 KOps/s | 6.5210 KOps/s | |
test_exec_functional_call | 0.2739ms | 0.1427ms | 7.0100 KOps/s | 6.8617 KOps/s | |
test_exec_td_decorator | 0.4114ms | 0.1908ms | 5.2408 KOps/s | 5.2365 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8854ms | 0.6768ms | 1.4775 KOps/s | 1.4658 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8990ms | 0.6808ms | 1.4689 KOps/s | 1.4514 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.9217ms | 0.6226ms | 1.6062 KOps/s | 1.6750 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7865ms | 0.6070ms | 1.6473 KOps/s | 1.6723 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.8156ms | 19.5862ms | 51.0562 Ops/s | 50.9604 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.5012ms | 19.7317ms | 50.6798 Ops/s | 50.9140 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.7473ms | 19.5007ms | 51.2802 Ops/s | 51.2738 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.7510ms | 19.4871ms | 51.3160 Ops/s | 51.3100 Ops/s | |
test_to_module_speed[True] | 1.0779ms | 0.9388ms | 1.0652 KOps/s | 1.0552 KOps/s | |
test_to_module_speed[False] | 1.2830ms | 0.9272ms | 1.0785 KOps/s | 1.0808 KOps/s | |
test_tc_init | 92.6610μs | 35.2660μs | 28.3560 KOps/s | 29.2644 KOps/s | |
test_tc_init_nested | 0.1668ms | 70.8528μs | 14.1138 KOps/s | 14.0853 KOps/s | |
test_tc_first_layer_tensor | 4.7300μs | 0.7064μs | 1.4157 MOps/s | 1.4253 MOps/s | |
test_tc_first_layer_nontensor | 0.1661ms | 2.3639μs | 423.0369 KOps/s | 432.0422 KOps/s | |
test_tc_second_layer_tensor | 57.0910μs | 1.4404μs | 694.2418 KOps/s | 698.8364 KOps/s | |
test_tc_second_layer_nontensor | 35.2110μs | 3.0912μs | 323.4965 KOps/s | 325.9764 KOps/s | |
test_unbind | 0.2445s | 10.1121ms | 98.8912 Ops/s | 146.3474 Ops/s | |
test_full_like | 10.4107ms | 9.4277ms | 106.0706 Ops/s | 105.2675 Ops/s | |
test_zeros_like | 4.9166ms | 4.3646ms | 229.1175 Ops/s | 230.5094 Ops/s | |
test_ones_like | 4.7033ms | 4.3720ms | 228.7307 Ops/s | 229.7932 Ops/s | |
test_clone | 12.0631ms | 9.3757ms | 106.6584 Ops/s | 152.6799 Ops/s | |
test_squeeze | 0.1534ms | 9.5739μs | 104.4504 KOps/s | 100.4905 KOps/s | |
test_unsqueeze | 0.2142ms | 71.2983μs | 14.0256 KOps/s | 13.8976 KOps/s | |
test_split | 0.3910ms | 0.1600ms | 6.2503 KOps/s | 5.9646 KOps/s | |
test_permute | 0.3452ms | 0.1820ms | 5.4932 KOps/s | 5.3308 KOps/s | |
test_stack | 52.1419ms | 51.5381ms | 19.4031 Ops/s | 19.3841 Ops/s | |
test_cat | 51.9602ms | 51.2580ms | 19.5092 Ops/s | 19.5693 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 6, 2024
ghstack-source-id: 34940a47d84bcf171bf4511187fcc82df88f801f Pull Request resolved: #1077
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):